ItAW SEQUENCE LISTING 
EltftOR REPORT 





S*^J*****^"«*ogy Systems Branch of the Scientific and Technical Inform>iti«ii % ^ 
W ^ ClItttMP (STIC) detected errors when processing the foUoTving computer readable 
^ ^ , form: 



•Application Serial Number: 
Source: 

Date Processed by STIC: 



m\c> 



\ ^kl^ ' ' ™^ ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

/ .^'Ife V '•WSASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

/ K??^™^^^^^®^™'^^^*™^®^'^ YOUR NEXT COMMUmCATIOW^^ 

APPLICANT. WITH A NOTICE TO COMPLY or, lU ttia 

^' ^ ^^|b?^S^^^'^®aco'yofthispri^ 

pr^ submission QUESTION 




FORSEQUENCE RULES INTERPM:TATI0N, PLEASE CONTACT ROBERT WAX, 703-30JM216. 
PATENTIN 2.1 e-mail help: patinZl help(%uspto.gov or phone 703-306-41 19 (R. Wax) 
PATENTIN3.0e-malI help: patin3help@uspto.^ov or phone 703-306-4119 (R. Wax) 

,T0 REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM. ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/ofnces/pac/checker 

AppIiamto siAimtting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 
8 possibiUty that the disk/CD-Rom may ha^en affected by treatment given to all incoming mail T 
Please consider usmg alternate methods o«SiBmission for the disk/CD-Rom or replacement disk/CD-Roth. 



i^lS^I? "'^ ' sequence listing in elMronic form should NOT b e sent to the 20231 zip code add««s for ? 
: ^f^^^'^''^ ""^ Trademark Office, and inste a d should be sent via the following to the indicated^>«««.. 

^ ^^•^^www.nspt o.gov^b<^^ , EBS Swi^S&rt^ 

toii3i«f«^ai.cPAVE) ^ ^^^^^m 

, .^-^ , V Patent and Trademark Office. Technology Center 1600. Reception Area, 7* Floor, Examiner Name. 

^SSsf^^^il'^ ''^•«*-'*--nt«iMntadeniark Offic*. Bdt Spcjiwiice, Customer Window, Lobby. Room 1B03 Cmtoii»MWolJ i 





j^lace, Arlto^gtoo, VAl^ 



er deUvery service to: U.S. Ptteat aod mBi^ 



iB03-Mailn>om, Cr:^ Two, 201 1 South Claik Plwe, Ariin^toii. VA 

12 




RAW SEQUENCE LISTING DATE: 07/11/2002 

PATENT APPLICATION: US/10/009,792 TIME: 14:06:50 



Input Set : A:\seqlist.txt 

Output Set: N:\CRr3\07I12002\J009792.raw 

3 <110> APPLICANT: Korea Advanced Institute of Science and Technology 

5 <120> TITLE OF INVENTION: E coli Secreting Human 

6 Granulocyte Colony Stimulating Factor ( hG - CSF ) 
8 <130> FTLK RFFFRFNCE: PO 1 3 2/KAIST/PCT 

C--> 10 <140> CURRENT APPLICATION NUMBER: US/10/0 0 9,7 92 

C--> 10 <141> CURRENT FILING DATE: 2001-12-13 

.10 <150> PRIOR APPLICATION NUMBER: KR 2000-0017052 

11 <151> PRIOR FILING DATE: 2000-03-31 

13 <160> NUMBER OF SEQ ID NOS : 27 

15 <170> SOFTWARE: KopatentIn 1.71 

ERRORED SEQUENCES , 







280 


<210> SEQ ID NO 


19 




























281 


<211> LENGTH: 174 














^rr^' ■ 












282 


<212> TYPE: 


PRT 
















V 














283 


<213> ORGANISM: 


Homo sapiens 


/ 


















285 


<400> SEQUENCE: 


19 
























286 


Thr 


Pro 


Leu 


Gly 


Pro 


Ala 


Ser 


Ser 


Leu 


Pro 


Gin 


Ser 


Phe 


Leu 


Leu 


Lys 


E- 


-> 


289 


Cys 


Leu 


Glu 


Gin 


Val 


Arg 


Lys 


He 


Gin 


Gly 


Asp 


Gly 


Ala 


Ala 


Leu 


Gin 


E- 


-> 


292 


Glu 


Lys 


Leu 


Cys 


Ala 


Thr 


Tyr 


Lys 


Leu 


Cys 


His 


Pro 


Glu 


Glu 


Leu 


Val 


E- 


-> 


295 


Leu 


Leu 


Gly 


His 


Ser 


Leu 


Gly 


He 


Pro 


Trp 


Ala 


Pro 


Leu 


Ser 


Ser 


Cys 


E- 


-> 


298 


Pro 


Ser 


Gin 


Ala 


Leu 


Gin 


Leu 


Ala 


Gly 


Cys 


Leu 


Ser 


Gin 


Leu 


His 


Ser 


E- 


- > 


301 


Gly 


Leu 


Phe 


Leu 


Tyr 


Gin 


Gly 


Leu 


Leu 


Gin 


Ala 


Leu 


Glu 


Gly 


He 


Ser 


E- 


- > 


304 


Pro 


Glu 


Leu 


Gly 


Pro 


Thr 


Leu 


Asp 


Thr 


Leu 


Gin 


Leu 


Asp 


Val 


Ala 


Asp 


E- 


- > 


307 


Phe 


Ala 


Thr 


Thr 


He 


Trp 


Gin 


Gin 


Met 


Glu 


Glu 


Leu 


Gly 


Met 


Ala 


Pro 


E- 


-> 


310 


Ala 


Leu 


Gin 


Pro 


Thr 


Gin 


Gly 


Ala 


Met 


Pro 


Ala 


Phe 


Ala 


Ser 


Ala 


Phe 


E- 


-> 


313 


Gin 


Arg 


Arg 


Ala 


Gly 


Gly 


Val 


Leu 


Val 


Ala 


Ser 


His 


Leu 


Gin 


Ser 


Phe 


E- 


-> 


316 


Leu 


Glu 


Val 


Ser 


Tyr 


Arg 


Val 


Leu 


Arg 


His 


Leu 


Ala 


Gin 


Pro 










345 


<210> SEQ ID NO 


21 




























346 


<211> LENGTH: 175 




























347 


<212> TYPE: 


PRT 






























348 


<213> ORGANISM: 


Homo sapiens 






















■35:0 


<400> SEQUENCE: 


21 




























351 


Met 


Thr 


Pro 


Leu 


Gly 


Pro 


Ala 


Ser 


Ser 


Leu 


Pro 


Gin 


Ser 


Phe 


Leu 


Leu 


E- 


-> 


354 


Lys 


Cys 


Leu 


Glu 


Gin 


Val 


Arg 


Lys 


He 


Gin 


Gly 


Asp 


Gly 


Ala 


Ala 


Leu 


E- 


-> 


3S7 


Gin 


Glu 


Lys 


Leu 


Cys 


Ala 


Thr 


Tyr 


Lys 


Leu 


Cys 


His 


Pro 


Glu 


Glu 


Leu 


E- 


- > 


360 


Val 


Leu 


Leu 


Gly 


His 


Ser 


Leu 


Gly 


He 


Pro 


Trp 


Ala 


Pro 


Leu 


Ser 


Ser 


E- 


- > 


363 


Cys 


Pro 


Ser 


Gin 


Ala 


Leu 


Gin 


Leu 


Ala 


Gly 


Cys 


Leu 


Ser 


Gin 


Leu 


His 


E- 


- > 


366 


Ser 


Gly 


Leu 


Phe 


Leu 


Tyr 


Gin 


Gly 


Leu 


Leu 


Gin 


Ala 


Leu 


Glu 


Gly 


He 


E- 


-> 


369 


Ser 


Pro 


Glu 


Leu 


Gly 


Pro 


Thr 


Leu 


Asp 


Thr 


Leu 


Gin 


Leu 


Asp 


Val 


Ala 


E- 


-> 


372 


Asp 


Phe 


Ala 


Thr 


Thr 


He 


Trp 


Gin 


Gin 


Met 


Glu 


Glu 


Leu 


Gly 


Met 


Ala 



rile://C:\CRF3\Outhol(f\VsrJ009792.htm 



RAW SEQUENCE LISTING 

t^ATENT APPLICATION: US/10/00 9,792 



DATE: 07/1 1/2002 
TIME: 14:06:50 



Input Set. : A : \seqlist . txt 

Output Set: N:\CRF3\G7112002\J009792.raw 



E- 


-> 


375 


Pro Ala Leu Gin 


Pro 


Thr 


Gin 


Gly 


Ala 


Met 


Pro 


Ala 


Phe 


Ala 


Ser 


Ala 


E- 


-> 


378 


Phe Gin Arg Arg 


Ala 


Gly 


Gly 


Val 


Leu 


Val 


Ala 


Ser 


His 


Leu 


Gin 


Ser 


E- 


-> 


381 

417 


Phe Leu Glu Val 

<210> SEQ ID NO: 


Ser 

25 


Tyr 


Arg 


Val 


Leu 


Arg 


His 


Leu 


Ala 


Gin 


Pro 








418 


<211> LENGTH: 4 5 




























419 


<212> TYPE: PKT 






























420 


<213> ORGANISM: 


Homo sapiens 






















422 


<4 00> SEQUENCE: 


25 




























423 


Met Phe Lys Phe 


Lys 


Lys 


Lys 


Phe 


Leu 


Val 


Gly 


Leu 


Thr 


Ala 


Ala 


Phe 


E- 


-> 


426 


Met Ser lie Ser 


Met 


Phe 


Ser 


Ala 


Thr 


Ala 


Ser 


Ala 


Thr 


Pro 


Leu 


Gly 


E- 


-> 


429 

446 


Pro Ala Ser Ser 

<210> SEQ ID NO: 


Leu 

27 


Pro 


Gin 


Ser 


Phe 


Leu 


Leu 


Lys 


Cys 












447 


<211> LENGTH: 60 




























448 


<212> TYPE: PRT 






























449 


<213> ORGANISM: 


Home 


1 sapiens 






















451 


<400> SEQUENCE: 


27 




























452 


Met Phe Lys Phe 


Lys 


Lys 


Lys 


Phe 


Leu 


Val 


Gly 


Leu 


Thr 


Ala 


Ala 


Phe 


E- 


-> 


455 


Met Ser lie Ser 


Met 


Phe 


Ser 


Ala 


Thr 


Ala 


Ser 


Ala 


Ala 


Gly 


Pro 


His 


E- 


-> 


458 


His His His His 


His 


He 


Glu 


Gly 


Arg 


Thr 


Pro 


Leu 


Gly 


Pro 


Ala 


Ser 


E- 


-> 


461 


Ser Leu Pro Gin 


Ser 


Phe 


Leu 


Leu 


Lys 


Cys 


Leu 


Glu 










E- 


-> 


469 




























E- 


-> 


472 


\6 ' V A,l .Jv 
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Page 3 of 4 



VERIFICATION SUMMARY 

PATENT APPLICATION: US/10/0 0 9,7 92 



DATE: 07/11/2002 
■I'lMh': 14:06:51 



Input Set A:\seqlist.txt 

Output Set: N:\CRF3\07112002\J009792.raw 



L:10 M:270 C: Current Application Number dittors, Replaced Current Application No 
I,:10-M:27] C: Current I'ilinq Date differs, Replaced Current Filing Date 
L:289 M:3J2 K: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 19 
M:332 Repeated in SeqNo=19 

L;354 M;332 F. : (32) I nva i id/Mi ssing Amino Acid Numbering, SEQ ID: 21 
M:332 Repeated in SeqNo=21 

L:426 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:25 
M:332 Repeated in SeqNo=25 

L:455 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:27 
M:332 Repeated in SeqNo=27 



file://C:\CRF3\Outhol(J\VsrJ009792.htm 



7/11/02 



Raw Sequence Lbting Error Summary 



F.RWOR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 1^' I ^'C^ i , '/ V 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WIHCH WERE INSERTED BY PTO SOFTWARE 



1 



10 



11 



12 



_Wrapped Nucleics The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
Wrapped Aminos was retrieved in a word processor aner creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 

_Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes while spaces. 



_MiijiliBiiiJAmino 
Numbering 

Non-ASCII 



Variable Length 



Patentln 2.0 
"bug" 



_Sldpped Sequences 
(OLD RULES) 



_Sldpped Sequences 
(NEW RULES) 



_Use of n's or Xaa's 
(NEW RULES) 



_Imralid<213> 
Response 



Useof<220> 



.Patentln 2.0 
"bug" 



The numbering under each 5"' amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission b saved In ASCII text. 

Scqucnce(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n orXaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies lo the mandatory <220>-<223> sections for 
Artificial or Unlcnown sequences. 



Sequence(s) 



missing If intentional, please insert the following lines for each skipped sequence: 



(2) rNFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO X: (insert SEQ ID NO where "X" is shotvn) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



Sequence<s) 



missing. If intentional, please insert the following lines for each skipped sequence. 



<2 10> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> lo <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 ofSequenceRilles, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Sequence(s) 



missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> sectioa 
(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resuhing in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 Misuse of n n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 



AMC/MH - Biotechnology Systems Branch - 08/21/2001 



